Formant analysis using mixtures of Gaussians

نویسندگان

  • Parham Zolfaghari
  • Tony Robinson
چکیده

This paper describes a new formant analysis technique whereby the formant parameters are represented in the form of Gaussian mixture distributions. These are estimated from the Discrete Fourier Transform (DFT) magnitude spectrum of the speech signal. The parameters obtained are the means, variances and the masses of the density functions, which are used to calculate centre frequencies, bandwidths and amplitudes of formants within the spectrum. In order to better fit the mixture distributions various modifications to the DFT magnitude spectrum, based on simple models of perception, were investigated. These include reduction of dynamic range, cepstral smoothing, use of the Mel scale and pre-emphasis of speech. Results are presented for these as well as formant tracks from analysing speech using the final formant analysis system.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A segmental formant vocoder based on linearly varying mixture of Gaussians

MIXTURE OF GAUSSIANS Parham Zolfaghari and Tony Robinson Cambridge University Engineering Department, Trumpington Street, Cambridge CB2 1PZ, UK. Tel: [+44] 1223 332754 Fax: [+44] 1223 332662 email : psz1000,[email protected] ABSTRACT This paper describes a low bit-rate segmental formant vocoder. The formants are estimated using mixture of Gaussians whose means are constrained to vary linearly w...

متن کامل

A formant vocoder based on mixtures of Gaussians

Parham Zolfaghari Tony Robinson Cambridge University Engineering Department, Trumpington Street, Cambridge CB2 1PZ, UK. Tel: [+44] 1223 332754 Fax: [+44] 1223 332662 email : psz1000,[email protected] ABSTRACT This paper describes a new low bit-rate formant vocoder. The formant parameters are represented by Gaussian mixture distributions, which are estimated from the discrete Fourier transform (...

متن کامل

Design of compact acoustic models through clustering of tied-covariance Gaussians

We propose a new approach for designing compact acoustic models particularly suited to large systems that combine multiple model sets to represent distinct acoustic conditions or languages. We show that Gaussians based on mixtures of inverse covariances (MIC) with shared parameters can be clustered using an efficient Lloyd algorithm. As a result, more compact acoustic models can be built by clu...

متن کامل

Adaptation of children's speech with limited data based on formant-like peak alignment

Automatic recognition of children s speech using acoustic models trained by adults results in poor performance due to differences in speech acoustics. These acoustical differences are a consequence of children having shorter vocal tracts and smaller vocal cords than adults. Hence, speaker adaptation needs to be performed. However, in real-world applications, the amount of adaptation data availa...

متن کامل

Formant model estimation and transformation for voice morphing

In this paper we consider the estimation and mapping of timevarying formant model parameters and orders for voice transformation. The model order is the number of perceptually significant formant trajectories estimated from an analysis of the poles of “over-modelled’’ linear prediction models of the source and target speech. A 2-D HMM with NF left–to-right states across frequency and M states a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996